Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📊 Model Evaluation
Benchmarking, Performance Metrics, Testing, Validation
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
122411
posts in
1.57
s
Constructing
Industrial-Scale Optimization Modeling
Benchmark
arxiv.org
·
14h
🎛️
Fine-Tuning
Analysis of systems with dependent components through a
variance-based
index and
regression
importance signature
sciencedirect.com
·
3h
🎯
AI Alignment
Benchmark
&
Compare
the Best AI Models
arena.ai
·
1d
🤖
AI
Systematic Parameter Decision in
Approximate
Model
Counting
chipublib.idm.oclc.org
·
2d
✍️
Prompt Engineering
Versioning
and Testing Data Solutions:
Applying
CI and Unit Tests on Interview-style Queries
kdnuggets.com
·
1d
🛡️
Red Teaming
Benchmarking
Large Language Models for Knowledge Graph
Validation
arxiv.org
·
14h
⚖️
AI Governance
Part 2 - AI Chat Evaluation of the Formal Language in He
Xin
's
PEPC
System
news.ycombinator.com
·
1d
·
Discuss:
Hacker News
✍️
Prompt Engineering
From
Backend
Engineer to AI Engineer: A Practical
Roadmap
(No Hype)
dev.to
·
54m
·
Discuss:
DEV
✍️
Prompt Engineering
What Agentic AI "Vibe Coding" In The Hands Of
Actual
Programmers
/ Engineers
stochasticlifestyle.com
·
7h
🤖
AI
I
benchmarked
4 CLI coding agents on an
NP-hard
optimization problem I solved by hand 8 years ago. One of them beat me.
charlesazam.com
·
4h
·
Discuss:
Hacker News
✍️
Prompt Engineering
The ODE ( O
verview
, D ata, and E
xecution
) protocol for a standardized use of machine learning in environmental,...
sciencedirect.com
·
8h
⚖️
AI Governance
Task 2: Refactor
SimulationConfig
for
DSGE-HA
· Issue #15
github.com
·
7h
💭
Context Management
How to
Leverage
Explainable
AI for Better Business Decisions
towardsdatascience.com
·
4h
⚖️
AI Governance
Beyond the
Prompt
- Why and How to
Fine-tune
Your Own Models
devblogs.microsoft.com
·
1d
🎛️
Fine-Tuning
part 4: Infrastructure services platform
microservices.io
·
11h
🤖
Agent Architectures
SotA
ARC-AGI-2 Results with
REPL
Agents
symbolica.ai
·
10h
·
Discuss:
Hacker News
⚖️
AI Governance
Feedback
Control for Computer Systems
janert.org
·
12h
✍️
Prompt Engineering
Show HN: A
header-only
C++ benchmark for predictive models on raw
binary
streams
github.com
·
11h
·
Discuss:
Hacker News
🛡️
Red Teaming
[
AINews
] Z.ai GLM-5: New
SOTA
Open Weights LLM
latent.space
·
12h
🤖
AI
From 97% Model Accuracy to 74% Clinical Reliability: Building
RSN-NNSL-GATE-001
dev.to
·
3h
·
Discuss:
DEV
🎯
AI Alignment
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help